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Abstract 

We consider Kingman's partition structures which are regenerative with respect to a general operation 
of random deletion of some part. Prototypes of this class are the Ewens partition structures which 
Kingman characterised by regeneration after deletion of a part chosen by size-biased sampling. We 
associate each regenerative partition structure with a corresponding regenerative composition structure, 
which (as we showed in a previous paper) can be associated in turn with a regenerative random subset 
of the positive halfline, that is the closed range of a subordinator. A general regenerative partition 
structure is thus represented in terms of the Laplace exponent of an associated subordinator. We also 
analyse deletion properties characteristic of the two-parameter family of partition structures. 
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1 Introduction and main results 



This paper is concerned with probability distributions for a random partition 7r„ of a positive integer n. 
We may represent 7r„ as a sequence of integer- valued random variables 

tt„ = (7r n ,i,7r„ j2 , . . .) with 7r n! i > 7r„ :2 > ■ • • > 

so TT n: i is the size of the ith largest part of Tt n , and 7r„.j = n. We may also treat 7r„ as a multiset of 
positive integers with sum n, regarding n n as a random allocation of n unlabeled balls into some random 
number of unlabeled boxes, with each box containing at least one ball. We call n n regenerative if it is 
possible to delete a single box of balls from 7r„ in such a way that for each 1 < x < n, given the deleted 
box contained x balls, the remaining partition of n — x balls is distributed as if x balls had been deleted 
from 7r„ by uniform random sampling without replacement. 

To be more precise, we assume that 7r„ is defined on some probability space (f2, T , P) which is rich 
enough to allow various further randomisations considered below, including the choice of some random 
part X n G 7r n , meaning that X n is one of the positive integers in the multiset 7r„ with sum n. The 
distribution of 7r„ is then specified by some partition probability function 

p(X) := P(tt„ = A) (Ah n) (1) 

where the notation A h n indicates that A is a partition of n. The joint distribution of 7r n and X n is 
determined by the partition probability function p and some deletion kernel d = d(X, x), A \~ n, 1 < x < n, 
which describes the conditional distribution of X n given ir n , according to the formula 

p(X)d(\,x) =P(tt„ = X,X n =x). (2) 

The requirement that X n is a part of n n makes d(X, x) = unless £ is a part of A, and 

Y d d{\x) = l (3) 

for all partitions A of n. Without loss of generality, we suppose further that ir n is the sequence of ranked 
sizes of classes of some random partition IT„ of the set [n] := {1, . . . , n}, where conditionally given n n all 
possible values of ±I n are equally likely. Equivalently, H n is an exchangeable random partition of [n] as 
defined in . For 1 < m < n let Ii m be the restriction of II„ to [m] , and let 7r TO be the sequence of 
ranked sizes of classes of n m . We say that the random partition 7r TO of m is derived from n n by random 
sampling, and call the distributions of the random partitions Tr m for 1 < m < n sampling consistent. A 
partition structure is a function p(X) as in Q for a sampling consistent sequence of distributions of TT n for 
n = 1, 2, . . .. This concept was introduced by Kingman j9j, who established a one-to-one correspondence 
between partition structures p and distributions for a sequence of nonnegative random variables Vi, V2 , • • ■ 
with V\ > V2 > ■ ■ ■ and Vi < 1. In Kingman's paintbox representation of p, the random partition 7r n 
of n is constructed as follows from (14) and a sequence of independent random variables Ui with uniform 
distribution on [0, 1], where (Ui) and (14) are independent: 7r n as in is defined to be the sequence of 
ranked sizes of blocks of the partition of [n] generated by a random equivalence relation ~ on positive 
integers, with i ~ j iff either i = j or both Ui and Uj fall in 1^ for some k, where the Ik are some disjoint 
random sub-intervals of [0, 1] of lengths 14- See also ^2] and papers cited there for further background. 

Definition 1 Call a random partition 7r„ of n regenerative, if it is possible to select a random part X n 
of 7r„ in such a way that for each 1 < x < n, conditionally given that X n — x the remaining partition 
of n — x is distributed according to the unconditional distribution of ir n - x derived from 7r„ by random 
sampling. Then 7r„ may also be called regenerative with respect to deletion of X n , or regenerative with 
respect to d if the conditional law of X n given 7r„ is specified by a deletion kernel d as in . Call a 
partition structure p regenerative if the corresponding n n is regenerative for each n — 1,2,.... 

According to this definition, ir n is regenerative with respect to deletion of some part X n £ 7r„ if and 
only if for each partition A of n and each part x G A, 

P(7r„ - A, X n = x)= F(X n = x) F(n n _ x = X - {x}) (X h n) (4) 
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where A — {x} is the partition of n — x obtained by deleting the part x from A, and ir n - x is derived from 
7r n by sampling. Put another way, 7r n is regenerative with respect to a deletion kernel d iff 

p(X)d(X, x) — q(n, x)p(X — {x}) , x 6 A (A h n) (5) 

where := F(7r m = /i) for /ihm and 1 < m < n and 

q{n,x):= ^ d(A, ai)p(A) = F(X n = x) (l<x<n) (6) 

{Ah?l : x£\} 

is the unconditional probability that the deletion rule removes a part of size x from 7r„. 

A well known partition structure is obtained by letting n n be the partition of n generated by the sizes 
of cycles of a uniformly distributed random permutation a n of [n] . If X n is the size of the cycle of a n 
containing 1, then 7r„ is regenerative with respect to deletion of X n , because given X n = x the remaining 
partition of n — x is generated by the cycles of a uniform random permutation of a set of size n — x. In 
this example, the unconditional distribution q{n 1 •) of X n is uniform on [n\. The deletion kernel is 

rf (A,x) = ^^ (Ahn) (7) 
n 

where A is a partition of n and a x (X) is the number of parts of A of size x, so Y] t xa x (X) = n. More 
generally, a part X n is chosen from a random partition 7r n of n according to Q may be called a size-biased 
part of 7r n . According to a well known result of Kingman if a partition structure is regenerative with 
respect to deletion of a size-biased part, then it is governed by the Ewens sampling formula 

»« = (5wII^ <»> 

for some parameter 9 > 0, where A is encoded by its multiplicities a r — a r (X) for r — 1, 2, . . ., with 

£ = £ a r , n = £ ra r (9) 

and 

n 

(0) nT 6 :=n^+(*- 1 ) 6 )- 

1=1 

The case 9 = 1 gives the distribution of the partition generated by cycles of a uniform random permuta- 
tion. Pitman (101 II 1| introduced a two-parameter extension of the Ewens family of partition structures, 
defined by the sampling formula 

for suitable parameters (a, 9), including 

{(a,6) : < a < 1, 6 > 0} (11) 

where boundary cases are defined by continuity. See | 14| for a review of various applications of this 
formula. The result of [31 Theorem 8.1 and Corollary 8.2] shows that each (a, 9) partition structure with 
parameters subject to (II II) is regenerative with respect to the deletion kernel 

, n A Qr (n-r)T + r(l - r) 

d(A ' r) " l-r+(i-l)r ' (12) 
where t = a/(a + 9) G [0, 1], and © follows easily from 10. In Section|21we establish: 

Theorem 2 For eac/i t £ [0, 1], i/ie only partition structures which are regenerative with respect to the 
deletion kernel 1(12(1 are the (a, 9) partition structures subject to 1(11(1 with a /(a + 9) = t. 

The following three cases are of special interest: 
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Size-biased deletion This is the case r = 0: each part r is selected with probability proportional 
to r. Here, and in following descriptions, we assume that the parts of a partition are labeled in some 
arbitrary way, to distinguish parts of equal size. In particular, if n n is the partition of n derived from 
an exchangeable random partition Tl n of [n], then for each i G [n] the size X n (i) of the part of n„ 
containing i defines a size-biased pick from the parts of ir n . Theorem|2|in this case reduces to Kingman's 
characterisation of the Ewens family of (0,6*) partition structures. Section [7| compares Theorem [21 with 
another characterisation of (a, 9) partition structures provided by Pitman |12| in terms of a size-biased 
random permutation of parts defined by iterated size-biased deletion. 

Unbiased (uniform) deletion This is the case r = 1/2: given that 7r n has £ parts, each part is 
chosen with probability l/£. Iteration of this operation puts the parts of 7r„ in an exchangeable random 
order. In this case, the conclusion of Theorem|21is that the (a, a) partition structures for < a < 1 are 
the only partition structures invariant under uniform deletion. This conclusion can also be drawn from 
Theorem 10.1 of [5]. As shown in |11U13| . the (a, a) partition structures are generated by sampling from 
the interval partition of [0, 1] into excursion intervals of a Bessel bridge of dimension 2 — 2a. The case 
a = 1/2 corresponds to excursions of a standard Brownian bridge. 

Cosize-biased deletion In the case r = 1, each part of size r is selected with probability proportional 
to the size n — r of the remaining partition. The conclusion of Theorem [21 in this case is that the (a, 0) 
partition structures for < a < 1 are the only partition structures invariant under this operation. 
As shown in [111 I13j. these partition structures are generated by sampling from the interval partition 
generated by excursion intervals of an unconditioned Bessel process of dimension 2 — 2a. The case a = 1/2 
corresponds to excursions of a standard Brownian motion. 

The next theorem, which is proved in Section [3| puts Theorem [3 in a more general context: 

Theorem 3 For each probability distribution q(n, ■ ) on [n], there exists a unique joint distribution of 
a random partition 7r„ of n and a random part X n of ~K n such that X n has distribution q(n, •) and n n is 
regenerative with respect to deletion of X n . 

Let 7r m , 1 < m < n be derived from ir n by random sampling. Then for each 1 < m < n the random 
partition ir m is regenerative with respect to deletion of some part X m , whose distribution q(m, •) is that of 
H m given H m > ; where H m is the number of balls in the sample of size m which fall in some particular 
box containing X n balls in 7r„ . 

The main point of this theorem is its implication that if ir n is regenerative with respect to deletion of X n 
according to some deletion kernel d(A, •), which might be defined in the first instance only for partitions 
A of n, then there is for each 1 < m < n an essentially unique way to construct d(A, •) for partitions A of 
m, so that formula © holds also for m instead of n. Iterated deletion of parts of 7r„ according to this 
extended deletion kernel puts the parts of ir n in a particular random order, call it the order of deletion 
according to d. This defines a random composition of n, that is a sequence of strictly positive integer 
random variables (of random length) with sum n. We may represent such a random composition of n as 
an infinite sequence of random variables, by padding with zeros. The various distributions involved in 
this representation of ir n are spelled out in the following corollary, which follows easily from the theorem. 

Corollary 4 In the setting of the preceding theorem, 

(i) for each 1 < m < n the distribution q(m, •) of H m is derived from q(n, •) by the formula 



q(m,k) 



qo(m, k) 



(1 < k < m) 



(13) 



1 - q {m, 0) 



where 




(0 < k < m). 
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(ii) Let X n ,i, X Ul 2, ... be a sequence of non-negative integer valued random variables such that X n ,i has 
distribution q(n, ■), and for j > 1 

F{X nd+1 = • | X nA + ■■■ + X n>j =r)= q(n - r, •) (14) 

with X n j + i = if X n ^i + • • • + X n j = n, so X n := (X n _i, X n _2, ■ ■ ■) is a random composition of n 
with 

£ 

P{X n = X) = Y[q(X J + --- + X e ,X J ) (15) 

3=1 

for each composition X of n with I parts of sizes Ax, A2, . . . , Xg. Then (n n ,X n ) with the joint 
distribution described by Theorem [21 can be constructed as follows: let X n = X n _i and define 7r n by 
ranking X n . 

(iii) For each 1 < m < n the distribution of n rn is given by the formula 

i 

P(vr m = A) = q ( X °ii) +'•■ + X «W> X «U)) ( 16 ) 

a j=l 

where X is a partition of m into I parts of sizes X\ > X2 > ■ ■ • > Xg > 0, and the summation extends 
over all ml/ Y[dj(X)\ distinct permutations a of the I parts of X, with cij(X) being the number of 
parts of X of size j . 

(iv) Let d(X, x) for partitions X of m < n and x a part of X be derived from q and p via formula l|S"|. 
and let X n be the random composition of n defined by the parts of 7r„ in order of deletion according 
to d. Then X n has the distribution described in part (ii). 

Following HH, we call a transition probability matrix q(m,j) indexed by 1 < j < m < n, with 
Sj=i 9( m i i) = 1> a decrement matrix. A random composition of n generated by q is a sequence of 
random variables X n := (A"„ i, X n> 2, . . .) with distribution defined as in part (ii) of the previous corollary. 
Hoppe 6 called this scheme for generating a random composition of n a discrete residual allocation 
model. 

Suppose now that X n is the sequence of sizes of classes in a random ordered partition ±1„ of the set 
[n], meaning a sequence of disjoint non-empty sets whose union is [n], and that conditionally given X n all 
possible choices of II n are equally likely. Let X m be the sequence of sizes of classes of the ordered partition 
of [m] defined by restriction of II ra to [m] . Then the X m is said to be derived from X n by sampling, and 
the sequence of distributions of X m is called sampling consistent. A composition structure is a sampling 
consistent sequence of distributions of compositions X n of n for n = 1, 2, . . .. 

Definition 5 Following we call a random composition X n = (X n i,X n 2, ■ ■ ■) of n regenerative, if 
for each 1 < x < n, conditionally given that X n \ = x the remaining composition (X n 2, ■ ■ ■) of n — x is 
distributed according to the unconditional distribution of X n — X derived from X n by random sampling. 
Call a composition structure (X n ) regenerative if X n is regenerative for each n = 1,2,.... 

Note the close parallel between this definition of regenerative compositions and Definitionnof regenerative 
partitions. The regenerative property of a random partition is more subtle, because it involves random 
selection of some part to delete, and this selection process is allowed to be as general as possible, while 
for random compositions it is simply the first part that is deleted. The relation between the two concepts 
is provided by the following further corollary of Theorem |3| 

Corollary 6 If the parts of a regenerative partition ir n ofn are put in deletion order to define a random 
composition of X n of n, as in part (iv) of the previous corollary, then X n is a regenerative composition 
ofn. 

This reduces the study of regenerative partitions to that of regenerative compositions, for which a 
rather complete theory has already been presented in In particular, the basic results of [3], recalled 
here in Section [SJ provide an explicit paintbox representation of regenerative partition structures, along 
with an integral representation of corresponding decrement matrices q. See also Section 0] for some 
variants of Corollary |SJ 
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2 Proof of Theorem [2] 

This is an extension of the argument of Kingman [Sj in the case r = 0. Recall first that when partitions 
A are encoded by their multiplicities, a r — a r (X) for r — 1,2,..., the sampling consistency condition on 
a partition probability function p is expressed by the formula 

i \ i -, N ai + 1 r(a r + 1) , . 

p(a 1 ,a 2 , ■■■)= P(ai + 1, ct2, ■ ■ •) — +>»(..., a r _i - 1, a r + 1, . . .) — — (17) 

n + 1 ^— ' ?i + l 

where p is assumed to vanish except when its arguments are non-negative integers, and n = ^ r ra r . 
Assuming that p is a regenerative with respect to d, iterating © we have for parts r, s G A, 

/u g(n,r) g(n-r,5) 

p(A) = d[M^j p(A - {r ' s}) ' (18) 

which can clearly be expanded further. Since this expression is invariant under permutations of the parts, 
interchanging r and s we get 

q(n,r) q(n — r,s) q(n,s) q(n — s,r) 



d(X, r) d{\ - {r}, s) d{\, s) d(X - {s}, r) ' 
Assume now that d is given by l|12f) . Introducing 

q(n, r)n 



b(n, r) 



(n — r)r + r(l — r) 



formula l|18l) yields b(n, r)b(n — r, s) — b(n, s)b(n — s, r). Taking s — 1 and abbreviating f(n) := b(n, 1) 
we obtain b(n, r)/b(n —1,7") = f(n)/f(n — r), thus 

b(n, r) = f{n - r + 1) • • • f(n - l)f(n)g(r) , for g(r) := ■ ' . 

The full expansion of p now reads 

pw = iia - r + z r) n /(fc) n ^f- 

i=0 fe=i r r ' 

where et r is the number of parts of A of size r, with Y,a r — £ and £ ra r = n. By homogeneity we can 
choose the normalisation g(l) = 1. Assuming that p is a partition structure, substituting into (|17|) and 
cancelling common terms gives 

- (1 -T + /r) + ^ra r _ r 



/(n+l) v ' £i r -g(r-lY 



Now defining h{r) by the substitution 

g(r) _ _t r - 1 
g(r — 1) r r 

we obtain 



h(r) 



" + X jT = 1 - t + ]T(r - l)a r ^h(r) 



fin , . 

which must hold for arbitrary partitions, hence h(r) — 7 for some constant. Therefore 

1 — t + (77 — 1)7 r! 
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It follows that 

PiX) - (l-r) nT7 1H H J £1 
which is positive for all A iff 7 > r. The substitution 

t I — T 
a = - , 6> = 

7 7 

reduces this expression to the two-parameter formula Q10[l. and Theorem [3 follows. 



3 Fragmented permutations 

We use the term fragmented permutation of [n] for a pair 7 = (er, A) 6 SS n x C„, where SS n is the set of 
all permutations of [n], and C' n is the set of all compositions of n. We interpret a fragmented permutation 
7 as a way to first arrange n balls labeled by [n] in a sequence, then fragment this sequence into some 
number of boxes. We may represent a fragmented permutation in an obvious way, e.g. 

7 = 2,3,9|1,8|6,7,5|4 

describes the configuration with balls 2, 3 and 9 in that order in the first box, balls 1 and 8 in that order 
in the second box, and so on, that is 7 = (a, A) for a = (2, 3, 9, 1, 8, 6, 7, 5, 4) and A = (3, 2, 3, 1). 

We now define a transition probability matrix on the set of all fragmented permutations of [n\. We 
assume that some probability distribution q(n, •) is specified on [n\. Given some initial fragmented 
permutation 7, 

• let X n be a random variable with distribution q(n, •), meaning 

P(X n = x) = q(n, x), 1 < x < n; 

• given X n — x, pick a sequence of x different balls uniformly at random from the 

n(n — 1) ■ • • (n — x + 1) 

possible sequences; 

• remove these x balls from their boxes and put them in order in a new box to the left of the remaining 
n — x balls in boxes. 

To illustrate for n — 9, if the initial fragmented permutation is 7 = 2, 3, 9 | 1, 8 | 6, 7, 5 | 4 as above, Xg — 4 
and the sequence of balls chosen is (7,4,8, 1), then the new fragmented permutation is 

7,4,8, 1 1 2, 3, 9 I 6, 5. 



Definition 7 Call the Markov chain with this transition mechanism the q(n, -)-chain on fragmented 
permutations of [n] . 

To prepare for the next definition, we recall a basic method of transformation of transition probability 
functions. Let Q be a transition probability matrix on a finite set S, and let / : S — > T be a surjection 
from S onto some other finite set T. Suppose that the Q{s, •) distribution of / depends only on the value 
of f(s), that is 

Y, Q(s,x) = Q(f(s),t), (teT) (19) 
for some matrix Q on T. The following consequences of this condition are elementary and well known: 
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• if (Y n ,n — 0,1,2,...) is a Markov chain with transition matrix Q and starting state xq, then 
(f(Y n ), n = 0, 1, 2, . . .) is a Markov chain with transition matrix Q and starting state f(xo)', 

• if Q has a unique invariant probability measure 7r, then Q has unique invariant probability measure 
tt which is the tt distribution of /. 

To decribe this situation, we may say that Q is the push-forward of Q by f. 

Definit ion 8 The q(n, -)-chain on permutations of [n] is the q(n, -)-chain on fragmented permutations 
of [ri] pushed forward by projection from (<r, A) to a. Similarly, pushing forward from (er, A) to A defines 
the q(n, -)-chain on compositions of n and pushing forward further from compositions to partitions, by 
ranking, defines the q{n 1 -)-chain on partitions of n. 

The q(n, -)-chain on permutations of [n] is the inverse of the top-to-random shuffle studied by Diaconis, 
Fill and Pitman [2] . Keeping track of packets of cards in this shuffle leads naturally to the richer state 
space of fragmented permutations. The mechanism of the q(n, -)-chain on compositions of n is identical 
to that described above for fragmented permutations, except that the labels of the balls are ignored. The 
mechanism of the q{n, -)-chain on partitions of n is obtained by further ignoring the order of boxes in the 
composition. The following lemma connects these Markov chains to the basic definitions of regenerative 
partitions and regenerative compositions which we made in Section ^ 

Lemma 9 Letq(n,-) be a probability distribution on [n]. Then 

(i) a random composition X n = (X nt i, X nj 2, ■ ■ ■)> with X n ^\ distributed according to q{n,-), is regen- 
erative if and only if the distribution of X n is an invariant distribution for the q(n, -)-chain on 
compositions of n, 

(ii) a random partition ir n is regenerative with respect to deletion of some part X n with distribution 
q(n, •) if and only if the distribution of ir n is an invariant distribution for the q(n, -)-chain on 
partitions of n. 

Proof. The proofs of the two cases are similar, so we provide details only in case (ii). The condition that 
7r„ is regenerative with respect to deletion of X n can be written as follows: 

TTn — {X n } = TTn-X n 

where 

(a) = denotes equality in distribution of two random elements of the set of partitions of m for some 
< m < n, allowing a trivial partition of 0. 

(b) on the left side 7r„ — {X n } denotes the random partition of n — X n derived from n n by deletion of 
the part X n of n ni 

(c) on the right side {jr m , < m < n) is a sampling consistent sequence of random partitions, indepen- 
dent Of X n , with 7T„ = 7T„. 

Consider the random partition 

7T* := TT n -x„ U {X n } 

obtained from 7r„ by first removing X n balls from 7r„ by random sampling, then putting all these balls 
in a new box. The conditional distribution of 7r* given 7r ra defines a transition probability matrix on 
the set of partitions of n, which is the transition matrix of the q(n, -)-chain on partitions of n. If 7r„ is 

regenerative with respect to deletion of X n , then 7r„ = 7r„ = 7r*. That is to say, the distribution of ir n 
is an invariant probability measure for the q(n, -)-chain on partitions of n. Conversely, if the distribution 

of 7r n is invariant for the q(n, -)-chain on partitions of n, so 7r n = 7r*, we can set tt„ := 7r*, and then by 
construction 

7T„ - {X n } — 7T* - {X n } — 7T„_X„- 

So 7r n is regenerative with respect to deletion of X n with distribution q{n, •). □ 
Theorem |21 and its corollaries now follow easily from the previous lemma and the following lemma: 
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Lemma 10 For each probability distribution q(n, •) on[n], the q(n, ■)- chain on fragmented permutations 
of [n] has a unique stationary distribution. Under this distribution, 

(i) the permutation of [n] has uniform distribution; 

(ii) the permutation and the composition are independent; 

(iii) the composition of n is generated by q according to Corollary^l 

Hence, the distribution on compositions described by Corollary 0] (ii) is the unique stationary probability 
distribution for the q(n, •)- chain on compositions of n, and distribution of 7r„ described by formula (lltjfl 
is the unique stationary probability distribution for the q(n, -)-chain on partitions of n. 

Proof. Let m := max{i : q(n, x) > 0}, and write n = km + r for positive integers k and r with 1 < r < m. 
We argue that whatever the initial state 7, there is a strictly positive probability that after k + 1 steps 
the q(n, -)-chain on fragmented permutations reaches the state 

1, 2, . . . , m\ ••• I (k — l)m + 1, . . . , km \ km + 1, . . . , n 

in which the permutation is the identity and the composition is (m, • • • , m, r). To see this, note that the 
transition mechanism ensures that after one step from 7 it is possible to reach a state of the form 

n — m + 1, n — m + 2, . . . , n | 

for some determined by the initial configuration 7. Then after two steps it is possible to reach a 

state of the form 

(k — 1)to + 1, . . . , km I km + 1, . . . , 77 

for some , and so on. Since there is a state which can be reached eventually no matter what the 

initial state, it follows from the elementary theory of Markov chains that a stationary distribution exists 
and is unique. 

Let Pq( n ,-) denote the push-forward of this stationary distribution to compositions of 77, that is the 
stationary distribution of the q(n, -)-chain on compositions of 77. By definition of the q(n, -)-chain on 
fragmented permutations, 

• under P q ( n ,-) the number of balls in the first box has distribution q(n, •); 

• under P q r n A, for each 1 < 777 < 77, given that the first box contains 77 — 777 balls, the remaining 
composition of 777 has the distribution on compositions of m derived from P q ( n ,A by taking a random 
sample of m out of the 77 balls, to be denoted (P g („ i .))"^ m . 

To complete the proof of the lemma, it just remains to check the key identity 

(P q (n,)) n ^ m = P,k-) (20) 

where q(m,-) is derived from q(n,-) by formula (|13|l . Due to independence of the composition and 
the permutation, a composition of m with distribution (P q ( n ,-)) n ^' m is obtained from the stationary 
distribution of the q(n, -)-chain on fragmented permutations by ignoring balls 777+I, . . . , 77, and considering 
the composition of m induced by the balls labeled by [m] . But when the fragmented permutation evolves 
according to the q{n, -)-chain, it is clear that at each step, no matter what the initial state, for < s < 777, 
the probability that exactly s of the m balls get moved to the left is qo(m, s) as in (|13|l . Let q(m, ■) be as in 
formula Since the probability of moving at least one of the first 777 balls is 1— qo(m, 0), no matter what 
the initial state, the q{n, -)-chain on fragmented permutations of [77] pushes forward to a Markov chain on 
fragmented permutations of [m] . The transition matrix of this chain is a mixture of the identity matrix 
and the matrix of q(m, -)-chain on fragmented permutations of [m], with weights qo(m, 0) and 1 — qo{m, 0) 
respectively. Hence the equilibrium distribution of this chain with state space fragmented permutations 
of [777] is identical to the equilibrium distribution of the q(m, -)-chain on fragmented permutations of [777], 
whose projection onto compositions of [m] is P q r m> ,\. This proves (|20|l . □ 
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4 Some corollaries 



This section spells out some further corollaries of Theorem [31 

Corollary 11 The distribution of every regenerative partition ir n of n is obtained by ranking the com- 
ponents of some regenerative composition X n of n, whose distribution is uniquely determined by that of 
7r„ . Then 7r„ is regenerative with respect to deletion of X n distributed like the first component of X n . This 
correspondence, made precise by the formulae of Corollary^] establishes bisections between the following 
sets of probability distributions: 

(i) probability distributions q{n, •) on [n] 

(ii) distributions of regenerative compostions X n of n 

(iii) distributions of regenerative partitions 7r„ on n. 

An explicit link from (iii) to (i) is provided by a recursive formula 3, Equation (34)] expressing q(n, ■) via 
the probabilities of one-part partitions (p(j), j = 1, • ■ • , n). There is also an explicit formula expressing 
q(n, •) as a rational function of probabilities (p(l m ), m — 1, . . . ,n) where l m h m is the partition with 
only singleton parts. 

As a variant of the above corollary, we record also: 

Corollary 12 Given a decrement matrixq = (q(m,j), 1 < j < m < n) for some fixedn, for 1 < m < n 
let X m be the random composition of m generated by q, and ir m the random partition of m obtained by 
ranking X m . The following conditions are equivalent: 

(i) the entire decrement matrix q is determined by q(n, •) according to formula l|13fl: 

(ii) the sequence of compositions (X m , 1 < m < n) derived from q is sampling consistent; 

(iii) the sequence of partitions {it m , 1 < m < n) derived from q as in (|16|l is sampling consistent. 

The equivalence of (i) and (ii) can also be read from [3] , where formula l|13[) was given only for m = n — 1 
as a means of recursively computing q(m, •) from q(n, •) for m < n. That (ii) implies (iii) is obvious. 
That (iii) implies (ii) is not obvious, but this is an immediate consequence of Theorem [3J and Corollary 
01 because (iii) means that 7r n is regenerative with respect to deletion of the first term of X n . 

Corollary 13 The following two conditions on a random composition X n — (X n _i, A„.2, . . .) of n are 
equivalent: 

(i) X n is regenerative; 

(ii) for each 1 < j < k < n, conditionally given X n> \, . . . , X n ,j with X n _i + • • • + X n j = k, the partition 
of n — k obtained by ranking A„ J+ i, A„.j + 2, ■ ■ ■ has the same distribution as Tr n -k, the partition of 
n — k obtained by sampling from X n . 

Proof. That (i) implies (ii) is obvious. Conversely, condition (ii) for j = 1 states that the partition 
TT n derived from X n is regenerative with respect to deletion of X n ,i- Let q(n, •) be the distribution of 
X n> \, and let d denote the corresponding deletion kernel, extended to partitions of m for 1 < m < n in 
accordance with Theorem [21 Condition (ii) for j = 2 implies that for each i such that q(n,i) > 0, the 
partition 7r n _,; is regenerative with respect to deletion of a part whose unconditional distribution equals 
the conditional distribution of X n 2 given X n _i = i. According to the uniqueness statement of Theorem 
[31 this distribution must be the distribution q(n — i, ■) determined by q(n, ■) via l|13|l . So the joint law of 
(A„.i,A„ 2) is identical to that described in Corollary 01 (ii). Continuing in this way, it is clear that the 
distribution of the entire sequence X n is that described in Corollary 01 (ii) . The conclusion now follows 
from Corollary □ 
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5 Paintbox representations 



Gnedin's paintbox representation of composition structures [S] uses a random closed set 7Z C [0,1] to 
separate points of a uniform sample into clusters. Given 1Z, define an interval partition of [0, 1] comprised 
of gaps, that is open interval components of [0, 1] \ 1Z, and of individual points of 1Z. A random ordered 
partition of [n] is then constructed from 1Z and independent uniform sample points U\ , . . . , U n by grouping 
the indices of sample points which fall in the same gap, and letting the points which hit 1Z to be singletons. 
A random composition X n of n is then constructed as the sequence of block sizes in this partition of [n] , 
ordering the blocks from left to right, according to the location of the correspoding sample points in 
[0,1]. Gnedin showed that every composition structure (X n ) can be so represented. As in Kingman's 
representation of partition structures, 1Z can be interpreted as an asymptotic shape of X n , provided X n is 
properly encoded as an element of the metric space of closed subsets of [0, 1] with the Hausdorff distance 
function. 

According to the main result of [3], each regenerative composition structure (X n ) is associated in 
this way with an 1Z which is multiplicatively regenerative in the following sense: for t G [0, 1] let D t := 
inf([i, 1] n 72.), and given that D t < 1 let 

ft [Dt ' 1] := {{z - A)/(l - D t ), zelZD [D t , 1]} 

which is the restriction of 1Z to [D t , 1] scaled back to [0, 1]; then 

(K [Dt - 1] | D t with D t <l,Kn [0,D t ]) = 1Z (21) 

meaning that the conditional distribution of 1Z^ Dt,1 \ given D t with D t < 1 and given 7Z (~1 [0, Dt], is 
identical to the unconditional distribution of 1Z. This condition holds if and only if — log(l — 1Z) is a 
regenerative subset of [0,oo[ in the usual (additive) sense. So by a result of Maisonneuve, the most 
general multiplicatively regenerative subset 1Z can be constructed as the closure of {1 — exp(— S t ), t > 0} 
for some subordinator (St,t > 0). Thus regenerative composition structures are parameterised by a pair 
(y, d) where v is a measure on ]0, 1] with finite first moment and d > 0. The measure v(du) is the image 
of the Levy measure v(ds) of the subordinator via the transformation from s to 1 — exp(— s). and d is 
the drift parameter of the subordinator. So the Laplace exponent of the subordinator, evaluated at a 
positive integer n, is 

$(n) =nd+ [ (1 - (1 - x) n )v{dx) , 
J]o,i] 

The decrement matrix of the regenerative composition is then 

<£>(n, r) 

g(n,r) = ' , 1 < r < n, n = 1,2, . . . 
<P(n) 

where 

^(n,r)=ndl{r = l)+( n ) [ x r (l - x) n - r v{dx) . 
W ^]0,i] 

Uniqueness of the parametrisation is achieved by a normalisation condition, e.g. ^(l) = 1. 

The partition structure derived by sampling from a random closed subset 1Z of [0, 1] depends only on 
the distribution of the sequence of ranked lengths induced by 1Z 

V(1Z) := {Vx{1Z),V 2 {1Z),...) 

where Vi(TZ) is the length of the zth longest interval component of [0, 1] \ 1Z. Our consideration of regen- 
erative partition structures suggests the following definition. Call 1Z weakly multiplicatively regenerative 
if for each t E [0, 1] 

(V(K [DtA] ) | D t with D t <l,lZn[0,Dt]) = V(K) (22) 

meaning that the conditional distribution of relative ranked lengths induced by 72.fl [D t , 1], given D t with 
D t < 1 and given the restricted set 1Z fl [0, Dt], is identical to the unconditional distribution of ranked 
lengths induced by 7Z. From Theorem [3] we easily deduce: 
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Corollary 14 A random closed subset 1Z of [0, 1] is weakly multiplicatively regenerative if and only if 
it is multiplicatively regenerative. 

Proof. The "if" part is obvious by measurability of the map from 1Z to V(1Z). To argue the converse, 
suppose that TZ is weakly multiplicatively regenerative. Without loss of generality, it can be supposed 
that 1Z is defined on the same probability space as a sequence of independent uniform [0, 1] variables Ui 
for i = 1,2,.... Let X n — (X n _i, X n ji, ■ ■ ■) for n — 1, 2, ... be the sequence of compositions of n derived 
from 1Z by sampling with these independent uniform variables, and let 7r„ be the partition of n defined 
by ranking X n . By consideration of (|22|) with t replaced by U n ,i, where U n ,k is the kth order statistic of 
U\, . . . , U n , it is easily argued that 7r„ is regenerative with respect to deletion of X n ^\. If X n ^i < n, we 
apply IL'l'l) at t — U n- x„ i+i and repeat the argument to show that ir n — {X n i,X n2 \ is a distributional 
copy of 7r n _ m given X n> i and X n ^ with X n> i + X n ^ — m. Iterating further we see that X n satisfies 
condition (ii) of Corollary 1131 Hence X n is regenerative. Thus (X n ) defines a regenerative composition 
structure, and it follows the main result of 3 that the set 1Z is regenerative. □ 

6 Uniqueness and positivity 

For each n equation JSJ is the basic algebraic relation between the functions p(X), d(X) for A h n and 
q(n, •). According to Corollary 21 q(n, •) uniquely determines p, for each n, but d satisfying JSJ need not 
be unique because p(X) may assume zero value for some A, as illustrated by the following example. 

Example (Regenerative hook partition structures.) A partition A of n is called a hook if it has at most one 
part larger than 1. The only regenerative partition structures p such that ir n is a hook with probability 
one for every n are those which can be generated by v = Si, the Dirac mass at 1 and d > 0, including 
the trivial boundary case with d = oo. Then for n > 1 

Q{n,n) = — — -, q(n,l) = — — -. 

1 + nd 1 + nd 

This implies that the associated hook composition structure gives positive probability only to compositions 
of the type (n) or (l m , n—m), 1 < m < n, for every n. For these hook composition structures the deletion 
kernel is an arbitrary kernel with the property 

d(X, 1) = 1 if 1 € A. (23) 

This property is characteristic, that is each partition structure regenerative according to such deletion 
kernel d is derived from a hook composition structure. Indeed, if a composition structure compatible with 
such d gave positive probability to a composition (x, y, . . .) with x > 1 then, by sampling consistency, the 
composition (2, 1) would also have positive probability, contradicting q(3, 2) = 0. 

The next lemma shows that only in the hook case can there be any ambiguity about the deletion 
kernel generating some partition structure 

Lemma 15 Let (ir n ) be a regenerative partition structure. Then 

(i) either (ir n ) is a hook partition structure, 

(ii) or p > 0, d > and q > for all admissible values of arguments. 
Moreover, (i) holds iff p{2,2) = 0, while (ii) holds iff p(2,2) > 0. 

Proof. If Vi in Kingman's representation is strictly positive with nonzero probability, then p(n, n) > 
for all n. By virtue of p{n,n) — q(2n,n)p(n) follows q(2n,n) > 0, but then by Corollary ^i) q(n,-) > 
and this implies p > and d > 0. In this case p(2, 2) > 0. Alternatively, if P(V 2 = 0) = 1 then p(2, 2) = 
and we are in the hook case. □ 
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Thus, if d is not a kernel satisfying l|2"3")) . then d(X,x) = for some partition A and some part x 6 A 
implies that a partition structure regenerative according to d is trivial, {i.e. is either the pure-singleton 
partition with p(l n ) = 1, or the one-block partition with p(n) = 1). 

The positivity condition in lemma rules out nontrivial partition structures, which have an absolute 
bound on the number of parts for all n. For example, none of the members of the two-parameter family 
of partition structures (fTOf) is regenerative for a < and —6 /a = k € {2, 3, . . .} because each 7r„ has at 
most k parts. 

Corollary 16 If a partition structure is regenerative and satisfies p(2, 2) > then q uniquely deter- 
mines p and d, and p uniquely determines q and d. Thus if a regenerative partition structure is not hook 
the corresponding deletion kernel is unique. 

Checking if a given partition structure is regenerative according to an unknown deletion kernel can be 
done by first computing q, by some algebraic manipulations, from a given partition probability function 
p, then computing a partition probability function for the regenerative partition structure related to 
this q, and finally checking if p = p„. When this method is applied to a partition from the two-parameter 
family with < a < 1, —a < 9 < 0, the resulting q recorded in J5J Equation (39)] is not everywhere 
positive, hence the partition structures with such parameters are not regenerative. 

7 Generalisations and related work 

Given some deletion kernel d, it is of interest to consider pairs of partition structures (po,pi) such that 
d reduces po to p\, meaning that the following extension of formula (jSJ) holds: 

p (X)d(X,x) = q(n,x)pi(X - {x}) , x e A (A h n) (24) 

where 

q(n, x) := 

d(X,x)p (X) (l<x<n) (25) 

{Ah« : rrGA} 

is the unconditional probability that the deletion rule removes a part of size x from 7r„ distributed 
according to pq. 

Pitman |12) showed that if po 1 pi 1 p2, ■ ■ ■ is a sequence of partition structures such that size-biased 
deletion reduces pi to Pi+\ for each i > 0, and po can be represented in terms of random sampling 
from (Vi, V2, . . •) with Vi > for each i, then po is an (a, 9) partition as in (|10ll for some < a < 1 
and 9 > —a, in which case pj is the (a, 9 + ja) partition structure. This result and Theorem |2 are 
two different two-parameter generalisations of Kingman's characterisation of (0, 9) partition structures. 
In the result of ^21> the deletion kernel is still defined by size-biased sampling, and repeated deletions 
generate a succession of partition structures. Whereas in Theorem [5] the deletion kernel is modified, and 
repeated deletions generate the same partition structure. 

A class of partition structures satisfying (|24|l is associated with Markovian compositions introduced 
in 0] . A composition structure of this type is derived from a set TZ which has a special leftmost interval 
[0, X], and otherwise TZC] [X, 1] is a scaled copy of some other multiplicatively regenerative set TV , which 
is independent of X and has the property (|21fl . For example, the members of the two-parameter family 
with < a < 1, 9 > —a can be associated with such Markovian composition structures. In general, 
such IZ can be represented as a transformed range of a finite-mean subordinator with arbitrary initial 
distribution. 

This discussion leaves open a number of interesting questions. One posed in |1(JI I14| . and apparently 
still open, is the problem of describing all pairs of partition structures (po,Pi) such that po reduces to 
Pi by size-biased deletion. One could ask the same question for other deletion kernels too. But size- 
biased deletion is of special interest because of its natural interpretation in terms of Kingman's paintbox 
construction from ranked random frequencies (Vi, V2, ■ ■ ■)'■ if X n is a sized biased pick from 7r„ derived by 
sampling from such (Vi) associated with the partition structure pa, then X n /n converges in distribution 
to Vi which is a size-biased pick from the limiting frequencies, meaning that 

V(V = V l \V u V 2l ...) = V l 
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V(V = 0\V 1 ,V 2 ,...) = l-'EiVi 

where it is assumed for simplicity that the Vi are almost surely distinct. The probability distribution of V 
on [0, 1], known as the structural distribution encodes many important features of the partition structure 
p . In particular, po(n) = E(P™~ 1 ) for n = 1,2, .. . (see [Q1E1E1)- See also [1^10] for related work. 

Central measures We comment briefly on how our results link to the potential theory on graded 
graphs, as developed by the Russian school in connection with the asymptotic representation theory of 
the symmetric group, see [7] for a survey. 

For A b n, fi b re+1 the number of ways to extend a given partition of [n] with shape A to some partition 
of [re + 1] with shape \x does not depend on the choice of partition of [re]. Denote this number k(A, /i). 
Letting V n denote the set of partitions of re, consider the graded graph with the vertex set V = (J V n 
and multiplicity K(A,/i) for the edge connecting A and /i. A partition structure is a nonnegative solution 
of the recursion 

P( X ) = ^2 k ( A >mMm)> Ahn, 71=1,2,... 

with p(l) = 1. 

For every A b re a directed path in V connecting the root (1) and A encodes a partition of [re] with 
shape A. Thus p defines a a central measure on the space of infinite paths of V, with the property that all 
paths of length n, connecting (1) and A b n, have the same probability depending only on A, for every re 
and AeP„. Each infinite path corresponds to a partition of the set N and the 'centrality' of a measure 
just means that the random partition of N is exchangeable. 

Originally, the concept of central measure emerged in connection with the Young lattice y, which 
has the same vertex set as V, but a different multiplicity function (dictated by the branching rule for 
characters of the symmetric group). According to a classical result of Thoma, central measures on y 
can be described in terms of two random decreasing sequences (Vj), (Wj) (as to be compared with a 
single decreasing sequence (V^) in Kingman's representation). See [S] for particular examples of central 
measures on y, and £Q for a general construction of central measures on graphs which generalise both y 
and V . 

Although the graphs y and V are different, there is a way to push a partition structure on V to a 
central measure on y, analogous to the transition from monomial symmetric functions to Schur symmetric 
functions. It would be interesting to study the image of regenerative partition structures under this 
mapping in some detail. 

General deletion kernels For partitions n and A, interpreted as distributions of unlabeled balls in 
unlabeled boxes, write /i C A if \i can be obtained from A by deletion of one or more boxes. Call a 
function d on pairs of partitions (A, /i) with /i C A, a deletion kernel if it satisfies 

< d < 1 , ^2 d(X, X~) = l. 

A": A-C A 

For each A the deletion kernel defines a random split in two subpartitions A + C A (which remains) and 
A~ C A (which is removed). We can also regard d as transition probability function for a Markov chain 
which jumps from larger partition A to a smaller A + . 

Given d, for a sequence of random partitions (jr n b re, re = 1, 2, . . .) - which in general need not satisfy 
any consistency condition besides that the ir n 's are defined on the same probability space - with each n n 
we associate a split into 7T+ and tt~ (with ir~ removed). This suggests various concepts of invariance, 
depending on how much conditioning is allowed. Thus we could distinguish a weak deletion property 

P(^+=A+|7r-=A-)=P(7r„_ r = A + ), r=|A"|, (26) 

and a strong deletion property 

P(tt+ - A+ | |tt-| = r) = P(7r n _ r = A+) 

where we write |A| for SAj. It would be interesting to extend results of this paper to such general deletion 
operations. 
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